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ABSTRACT 

A total of 102 Spanish-speaking and 104 
English-speaking preschool 4-year-old pupils were individually 
administered the Spanish and English versions of the Cooperative 
Preschool Inventory (CPI) to examine item responses for possible 
bias. Miminum logit chi-square regression was applied to each of the 
30 items of the CPI verbal scale. Each of the verbal items was 
cross-classified by response (correct, incorrect), verbal achievement 
(low, middle, high), and language proficiency (monolingual Spanish or 
monolingual English). A total of four items exhibited a significant 
chi-square for language proficiency in the model. In each case, the 
English speaking students' probability of a correct response was 
higher than the Spanish-speaking students. The only item displaying 
an achievement by language proficiency interaction effect was the 
question, "If you wanted to find a lion, where would you look?" 
Spanish-speaking children of low achievement had a higher probability 
(24%) of answering this item correctly than low achieving 
English-speaking children (4%). To know if these differences are, 
indeed, more than just potential bias would require follow-up 
interviews and further examination of the items. Future research may 
help to verify if there is bias in the items of the Cooperative 
Preschool Inventory. Tables and figures are included. (NEC) 
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Abstract 

A total of 102 Spanish-speaking preschool four-year-old pupils 
and 104 English-speaking four-year-old pupils were individually 
administered the Spanish and English versions of the 
Cooperative Preschool Inventory (CPI). Minimum logit chi- 
square regression was applied to each of the 30 items of the 
verbal scale of the CPI. Several differences in the items of the 
Spanish and English versions of the CPI are noted. 
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The fidelity of translations of psychological scales has been a concern of 
researchers. High quality translations allow the examination of psychological 
constructs in different cultures and in groups speaking different languages. 
Hulin, Drasgow, and Parsons (1983) summarized four types of translations: 
(a) accurately in the target language, (b) the aesthetic-poetic translations 
where the primary purpose is to evoke moods, feelings, and affect in the target 
language, (c) the ethnographic translation in which a major aim is to maintain 
the cultural content of the source language, and (d) the linguistic translation 
which is concerned with the equivalence of meanings of both morphemes and 
grammatical forms of the two languages. The purpose of this study was to 
examine the item responses to Spanish-speaking and English-speaking preschool 
children on the verbal scale of the Cooperative Preschool Inventory 
(Caldwell, 1970, 1974) to determine if there was potential item bias in the test. 

In addition to examining items for possible bias, this study utilized logit 
regression as the statistical procedure. Logit regression may be used to analyze 
contingency tables and to use a dichotomous response, converted to a continuous 
scale, as the dependent variable. The independent variables may be measured on 
a nominal, ordinal, or interval scale. Such flexibility provides logit regression an 
advantage over contingency table analysis. 



IThe authors appreciate the comments of Patricia B. Jones, Division of 
Educational Foundations, University of Arizona, on an earlier version of this 
paper. 
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METHOD 

Subjects 

The study consisted of two independent samples of preschool four-year-old 
pupils enrolled in the same remedial education program in the fall of 1982, 1983, 
and 1984 in a large, urban school district in the Southwest. The first sample 
consisted of 102 Spanish-speaking pupils (42 boys and 60 girls). The ethnic 
background of this sample comprised 1 Black, 2 Native American, and 
99 Hispanic children. The second sample consisted of 104 English-speaking pupils 
(44 boys and 60 girls). The ethnic background of the second sample was 15 Black, 
10 Native American, 1 Asian, and 78 Hispanic children. 

Instrument 

The English CPI (Caldwell, 1970) is an individually administered English 
language inventory of school-readiness. A Spanish translation of the CPI 
(Caldwell, 1974) is used in many programs to assess the school readiness of 
Hispanic pupils. Although the CPI was published over a decade ago, it is 
currently marketed. The Spanish translation may be called a pragmatic 
translation since the primary purpose of the translation is to communicate 
accurately in the target language. The Spanish version of the CPI is a direct, 
literal translation of the English which is administered individually by a Spanish- 
speaking examiner. The CPI is administered in about 15 minutes and pupil 
responses are scored as correct or incorrect. The CPI consists of 64 items which 
are grouped into two subscales: (a) a verbal scale of 33 items, and (b) a motor 
scale of 35 items. Four items of the CPI are considered part of both the verbal 
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and motor subscales. This instrument is designed to be a brief assessment and 
screening procedure for individual use with children in the age range from three 
to six years. It is employed variously as a screening device, a school-readiness 
measure, an achievement test, and an evaluation instrument. Many school 
districts use the CPI to identify those individuals unprepared for traditional 
programs. 

Previous research has supported the reliability and validity of the CPI. 
Powers and Medina (1984) reported alpha reliability estimates of .92 for the 
English CPI and .90 for the Spanish CPI. In a later study, Powers and Medina 
(in press) reported that the factor structure of the inventory for Spanish and 
English versions were similar. 



Logit degression 

Minimum logit chi-square regression was proposed by Berkson (1944, 1953) 
as an alternative to maximum likelihood estimation ol regression coefficients in 
the case of the logistic response model (Duncan 1978). Discussions of logit 
regression may be found in Snedecor and Cochran (1967), Theil (1970), and 
Hanushek and Jackson (1977). 

For a dichotomously scored variable y which is related to a vector of 
variables x, the probability that y = 1 is given by the logistic function 

P(Y=1) = i 

then P(Y = O) = Q = 1 - P 
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In order to estimate the parameters d and j3 of the equation, one may 
take 

logit P = In £ = a + j3x 
where 

P = population proportion of Y = 1 responses 

a = the population intercept of the regression line 

/? = the vector of true slopes of the straight line 
It is assumed that the xs are observations, p is the observed value of P, which is 
a random binomial variable distributed around the true P at x with variance = 
PQ/n. Berkson (1944) showed that the method of least squares could be used to 
estimate the parameters of the logistic function. Berkson obtained a minimum 
logit chi-square estimate based on minimization of the following algorithm: 

X 2 (logit) = £npq(l-t) 2 

where 

n = number of subjects at x 

p = 1-q = the observed proportion 

1 = Infp/q) = the observed logit 

1 = ln(p/q) s a+bx = the estimated value of the logit 

where 

1 

p = 

1 + e -(P + 0x) 
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The purpose, then, is to obtain a weighted least squares solution of the straight 

A 

line 1 = a + bx because the logit transformation is intrinsically heteroskedastic 
violating an important assumption of ordinary least squares. In the case of 
heteroskedasticity, OLS does create unbiased j3 although the V ( j3 ) is biased. 

Hanushek and Jackson (1977) described the application of minimum logit 
chi-square regression to a contingency table and Duncan (1978) showed how logit 
regression could be implemented with SPSS. "The log of the odds transformation 
converts the probability estimates to a continuous unbounded variable which 
becomes the dependent variable in a linear model with the category definitions 
as explanatory variables." (Hanushek and Jackson, 1977, p. 191). Logit 
regression is appropriate where the dichotomous response is regressed on any 
combination of categorical independent variables and interval-level independent 
variables. If the interval independent variable were instead a categorical 
variable, then some method of multiway contingency table analysis would be 
appropriate. With logit regression, main and interaction elxects on responses can 
be explored with both nominal, ordinal, and interval independent variables. 

Procedure 



in 



Pupils entering the preschool program were tested individually i 
October 1982, 1983, and 1984 with the Spanish or English CPI. These language 
versions were administered approximately one month after the beginning of 
school so that the child would become accustomed to the new surroundings and 
to the teacher. Further, the teacher was able to observe the students' language 
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production in a natural setting and to determine the child's predominant 
language. 

Students were categorized as monolingual Spanish or monolingual English 
based on teacher observation of students 1 language production. Students 
identified us bilinguals were excluded from this study. The total score of the 
CPI verbal scale was used to place students in three ordered categories of 1) low 
achieving (1-20), 2) middle achieving (21-40), and 3) high achieving (41-64). 
Response x achievement x language crosstabulation tables were prepared for 
each item of the CPI Verbal Scale. The crosstabulation tables were entered into 
the SPSS regression program following the procedures in Duncan (1978). 

Pupils in this study were from lower socioeconomic levels. Eligibility for 
this educational preschool program which focused on raising reading, language 
arts, and mathematics skills included the following criteria for participation: (a) 
the chiJd must be the sibling of an older educationally disadvantaged child, (b) at 
least one parent of the child lacks a high school education, (c) the child 
participates in a free lunch program, and (d) the child has limited proficiency in 
English. Therefore, the possible confounding variable of socioeconomic status 
was somewhat controlled. 

RESULTS AND DISCUSSION 
Each of the verbal items was cross-classified by response (correct, 
incorrect), verbal achievement (low, middle, high), and language proficiency 
(monolingual Spanish, monolingual English). Language proficiency (monolingual 
Spanish or monolingual English) of the child was recorded by the classroom 
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teacher after the observation of the child for a month in the preschool program 
and an examination of school records. 

Frequencies of correct and incorrect responses of a 3 (achievement) x 2 
(language proficiency) table were entered into an SPSS regression program with 
the natural logarithm of the ratio of correct response to incorrect response (log 
of odds, or logit) as the dependent variable. An achievement by proficiency 
interaction term was computed which, along with achievement and proficiency 
variables, was entered as a predictor variable in the regression equation. A full 
model was estimated and logit chi-sqaure values were calculated from the 
standard SPSS output using procedures described by Duncan (1978). 

A chi-square test of the effect of achievement on the probability of a 
correct response was computed for each item. A significant chi-square would 
indicate a difference in the probability of a correct response for individuals at 
different achievement levels. Another chi-square test of the effect of language 
proficiency on the probability of a correct response was computed for each item. 
This significant chi-square would be interpreted as the effect of language 
proficiency on the probability of a correct response. A chi-square test of the 
effects of the interaction of achievement and language on the probability of a 
correct response was also computed. A significant chi-square would indicate a 
differential effect on the probability of a correct response as a function of 
achievement level and language proficiency. 

Typical results of item chi-squares are given in Figures 1 and 2. An 
example of an item exhibiting a significant chi-square of the effects of 
achievement on the probability of a correct response is presented in Figure 1. In 
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this example, neither the chi-square test for language proficiency nor the chi- 
square test for interaction was significant. A substantive interpretation of a 
significant chi-square of achievement would be that the probability of a correct 
response increases with verbal ability. Figure 1 also displays the observed logits 
and the least squares regression line. Figure 2 shows an item with significant 
chi-square for interaction. Fiom this graph, it is clear that the probability of a 
correct reponse for low achieving Spanish-speaking students exceeded that of 
low achieving English-speaking students. High achieving English-speaking 
students had a higher probability of answering this item correctly than did the 
high ability Spanish-speaking student. 

A total of 27 items revealed significant chi-square tests of the effects of 
achievement on probability of a correct response. In each case, the probability 
of a correct response increased with verbal achievement levels as would be 
expected. Items without a significant chi-square for achievement were 1, 24, 28, 
36, 38 and 40. The reason item 1 did not have a significant chi-squa/e fro 
achievement was that it was an easy item students at each achievement level 
generally answered it correctly. Refer to Table 1 for a listing of all the verbal 
items of the CPI and chi-square values. 

A total of four items (2, 19, 2i, 24) exhibited a significant chi-square for 
language proficiency in the model. In each case the EnglLh speaking students 
probability of a correct response was higher than the Spanish-speaking students. 
2. How old are you? ^Cuantos anos tienes? 

19. If you were sick who would you go tc? ^Si estas enfermo, a quien 
vas a ver? 
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21. If you wanted to buy some gas where would you go? ^Si quieres 

comprar gasolina, a donde vas? 
24. If you wanted to find a lion, where would you look? ^Si quieres 

hallar (encountrar) un leon, donde lo buscas. (Caldwell 1970, p. '-'-7; 

1974, p. 4-8) 

The only item displaying an achievement by language proficiency 
interaction effect was items 24. This item is presented in Table 2 as the 
percentage of children within each cell of the achievement by language 
proficiency matrix. Item 24 was "If you wanted to find a lion where would you 
look?" (Caldwell, 1970, p. 7)(or Si quieres hallar (encontrar) un leon, donde lo 
buscas?)(Caldwell, 1974, p. 8). Spanish-speaking children of low achievement had 
a higher probability (24%) of answering this item correctly than low achieving 
English-speaking children (4%) and high achieving English-speaking children 
(96%) appeared to have known this item better than high ability Spanish-speaking 
children (43%). 

To Know if these differences are, indeed, more than just potential bias 
would require follow-up interviews and further examination of the items* Future 
research may help to verify if there is bias in the items of the Cooperative 
Preschool Inventory. 
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Table 1 

Chi-square Tests of Independence for Achievement (Ach), Language Proficiency 
(Pro) and Interacts (Int) for each Verbal Item of the Cooperative Preschool 
inventory. 



Chi-square 



Item 



Ach 



Pro 



Int 



1 
2 
3 
6 
7 
8 
19 
20 
21 
22 
23 
24 
23 
26 
27 
28 
29 
3C 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
48 
56 
57 
58 



.03 


1.35 


.25 


15.46*** 


4.76* 


.90 


10.00** 


.01 


.31 


1 1 .45*** 


.88 


.48 


13.08 *** 


2.07 


.45 


10. ;>8 ** 


1 .17 


2.73 


10.35 ** 


4.59* 


.62 


6.53* 


.27 


.14 


15.13*** 


6.61 ** 


3.20 


9.78 * 


.00 


.03 


' 1 77 


.02 


.72 


.51 


11 .23*** 


14.23*** 


27.10*** 


.36 


2.72 


13.49*** 


.02 


.09 


17.36*** 


.54 


2.22 


2.80 


.00 


.33 


35.05*** 


.17 


1.42 


5.17* 


.10 


.26 


11 .83*** 


2.00 


.76 


24.67 *** 


3.34 


1.69 


23.14*** 


.62 


2.65 


11.74*** 


.00 


.42 


10.88**** 


.04 


.03 


.23 


2.37 


3.44 


14.73*** 


1.45 


1.S2 


1.54 


.04 


.02 


12.09*** 


.09 


.55 


.91 


.51 


1.04 


20.40**** 


.73 


.23 


8.02** 


.33 


1.40 


20.31 *** 


.02 


.03 


11.63*** 


2.56 


.41 


6.24* 


.07 


.13 



Note. 



*X 2 _(1) 
**X ,0) 
***X (1) 



3.84, p< .05. 
6.64, p < .01. 
10.83, p< .001. 
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Table 2 

Percentages of Correct Responses within Cells of Achievement x Language 
Categories of Item 24 with Significant Interaction. 



Item 


Language 


Low 


Middle 


High 


Chi-sqaure 


24 


E 


4% 


45% 


96% 


14.23*** 




S 


27% 


29% 


43% 





Note. E = Monolingual English group, S = Monolingual Spanish group. 
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Figure 1. Regression of Response of Item 29 on Achievement by Languaqe Proficiency 
Legend x = Monolingual Spanish 
D = Monolingual English 
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Figure 2. Regression of Response of Item 24 on Achievement by Language Proficiency. 
Legend x = Monolingual English 
G3 = Monolingual Spanish 
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APPENDIX A 

SPSS Program Cards from Duncan's (1978) Procedure for Calculating Minimum 
Logit Chi-square Regression as applied to one item of this study. 



VARIABLE LIST 
INPUT MEDIUM 
INPUT FORMAT 
N OF CASES 
COMPUTE 
COMPUTE 
COMPUTE 
WEIGHT 
REGRESSION 

READ INPUT DATA 
FINISH 



XI, X2, A, B 
CARD 

FIXED (4F2.0) 
6 

Y = LN ((A+5)/(B+.5)) 

\VT = (A + 5)*(B+.5)/(A+B+I) 

X1X2 = X1*X2 

WT 

VARIABLES = Y, XI, X2, X1X2 
REGRESSION = Y WITH XI, X2, X1X2(3) 



Note. With all three variables entered in the multiple regression equation, the 
chi-square values are obtained by multiplying the F value for each 
variable by the residual mean square. 
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